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[57] ABSTRACT 

Two pipeline (255,233) RS decoders, one a time domain 
decoder and the other a transform domain decoder, use 
the same first part to develop an errata locator polyno- 
mial r(x), and an errata evaluator polynominal A(x). 
Both the time domain decoder and transform domain 
decoder have a modified GCD that uses an input multi- 
plexer and an output demultiplexer to reduce the num- 
ber of GCD cells required. The time domain decoder 
uses a Chien search and polynomial evaluator on the 
GCD outputs r(x) and A(x), for the final decoding 
steps, while the transform domain decoder uses a trans- 
form error pattern algorithm operating on r(x) and the 
initial syndrome computation S(x), followed by an in- 
verse transform algorithm in sequence for the final de- 
coding steps prior to adding the received RS coded 
message to produce a decoded output message. 
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ARCHITECTURE FOR TIME OR TRANSFORM 
DOMAIN DECODING OF REED-SOLOMON 
CODES 

5 

ORIGIN OF INVENTION 

The invention described herein was made in the per- 
formance of work under a NASA contract, and is sub- 
ject to the provisions of Public Law 95-517 (35 USC 
202) in which the Contractor has elected to retain title. 10 

BACKGROUND OF THE INVENTION 

This invention relates to a method and apparatus for 
finding the error locator polynomial required for de- 
coding a Reed-Solomon (RS) code, and more particu- 15 
larly to an efficient, VLSI implementable, time or trans- 
form domain RS decoder. 

By this invention, both the errata locator and errata 
evaluator polynomials can be obtained simultaneously 
and simply by the Euclidean algorithm. With these 20 
ideas, the complexity of both the time and transform 
domain Reed-Solomon decoders for correcting errors 
and erasures can be simplified. As a consequence, de- 
coders for correcting errors and erasures of Reed-Solo- 
mon codes can be made more modular, regular, simple, 25 
and naturally suitable for VLSI implementation. 

The Euclidean algorithm for solving the key equation 
for decoding BCH and Goppa type codes was disclosed 
by Sugiyama, et al., in Y. Sugiyama, M. Kasahara, S. 
Hirasawa and T. Namekawa, “A method for solving 30 
key equation for decoding Goppa codes,” IEEE Trans, 
on Contr., vol. 27, 1975, pp. 87-99. A fast decoding of 
RS codes using continued fraction which is closely 
related to the Euclidean algorithm was disclosed by I. 

S. Reed, R. A. Scholtz, T. K. Truong and L. R. Welch, 35 
“The fast decoding of Reed-Solomon codes using Fer- 
mat theoretic transforms and continued fractions,” 
IEEE Trans, on Information Theory , vol. IT-24, no. 1, 
January 1978., and L. R. Welch and R. A. Scholtz, 
“Continued fractions and Berlekamp’s algorithm,” 40 
IEEE Trans, on Information Theory , vol. IT-25, 1979, 
pp. 19-27. Brent and Kung in “Systolic VLSI arrays for 
polynomial GCD computation,” IEEE Trans, on Com - 
puters, vol. C-33, no. 8, August 1984, pp. 731-736, were 
the first to suggest a systolic array architecture for com- 45 
puting the greatest common divisor (GCD) of two 
polynomials. By the use of the above ideas, a pipeline 
structure for a transform domain decoder was devel- 
oped to decode errors of RS codes. See H. M. Shao, T. 

K. Truong, L. J. Deutsch, J. H. Yuen and I. S. Reed, “A 50 
VLSI design of a pipeline Reed-Solomon decoder,” 
IEEE Trans, on Computers , vol. C-34, no. 5, May 1985. 

An important ingredient of this design was a modified 
Euclidean algorithm for computing the error-locator 
polynomial. 55 

The computation of inverse field elements is com- 
pletely avoided in the above-mentioned modification of 
Euclid’s algorithm. Recently, it was proposed that a 
recursive algorithm could be used to perform this modi- 
fied Euclidean algorithm. H. M. Shao, T. K, Truong, I. 60 
S. Hsu, L. J. Deutsch and I. S. Reed, “A single chip 
VLSI Reed-Solomon decoder,” presented at Interna- 
tional Conference on Acoustics, Speech and Signal 
Processing, ICASSP, Tokyo, Japan, Apr. 7-11, 1986. 

An important advantage of that new recursive algo- 65 
rithm is that the entire systolic array needed to perform 
Euclid’s algorithm requires substantially less silicon 
area than the pipeline version of the modified Euclidean 


2 

algorithm, given in IEEE Trans, on Computers , May 
1985, cited above. 

Forney defined in “On decoding BCH codes,” IEEE 
Trans, on Information Theory \ vol. IT- 11, 1965, pp. 
549-557, an errata locator polynomial using what are 
now called Forney syndromes to correct both errors 
and erasures. Blahut showed, in Theory and Practice of 
Error Control Codes, Addison- Wesley Publishing Co., 
CA, May 1984, pp. 258, that the errata locator polyno- 
mial can be computed directly by initializing Ber- 
lekamp’s algorithm with the erasure locator polyno- 
mial. 

Recently W. L. Eastman, “Decoding erasures,” The 
Mitre Corporation Report, Bedford, MA, 1986 sug- 
gested that the errata evaluator polynomial can be com- 
puter directly by initializing Berlekamp’s algorithm 
with the Forney syndrome polynomial. By this tech- 
nique, it is possible to compute the errata locator poly- 
nomial and the errata evaluator polynomial simulta- 
neously from the Euclidean algorithm. This new RS 
decoder uses both the erasure locator polynomial and 
the Forney syndrome polynomial as initial conditions 
for the Euclidean algorithm. 

It is shown and proved by T. K. Truong, W. L. East- 
man, I. S. Reed, and I. S. Hsu, “A simplified procedure 
for correcting both errors and erasures of a Reed-Solo- 
mon code using the Euclidean algorithm,” submitted to 
Proc. IEE t 1987 that the modified Euclidean algorithm 
mentioned above can be used to solve the Berlekamp- 
Massey key equation for the errata locator polynomial 
and the errata evaluator polynomial directly and simul- 
taneously. By this means a new simplified pipeline ar- 
chitecture for both the time and transform domain de- 
coders can be developed for correcting both errors and 
erasures of RS codes. Such a decoding technique can be 
faster and simpler than previous methods. See I. S. 
Reed, T. K. Truong and R. L. Miller, “Decoding of 
B.C.H. and RS codes with errors and erasures using 
continued fractions,” Electronics Letters, vol. 15, no. 17, 
Aug. 16, 1976, pp. 542-544. I. S. Reed, T. K. Truong 
and R. L. Miller, “Simplified algorithm for correcting 
both errors and erasures of Reed-Solomon codes,” Proc. 
IEE, vol. 126, no. 10, October 1979, pp. 961-963. 

In this invention, it is found that the VLSI implemen- 
tation of the transform domain decoder is simpler than 
the time domain decoder. However, for a long RS code, 
due to the large size of the inverse transform unit 
needed in the transform decoder, the VLSI area needed 
to implement the transform domain decoder can be 
substantially larger than that needed for the time do- 
main detector. For moderately long codes, such as the 
(255,223) RS code used in the concatenated coding 
system for NASA’s Voyager mission, the transform 
domain decoder is still simpler than the time domain 
decoder. 

The above-mentioned NASA coding system is called 
the “baseline” system. It used a (7,i) convolutional code 
as its inner code and an 8-bit (255,223) RS code as its 
outer code. It is shown that this system achieves a bit- 
error rate (BER) of 10“ 6 at a bit signal-to-noise ratio 
(SNR) of 2.53 dB. See R. L. Miller, L. J. Deutsch and 
S. A. Butman, “On the error statistics of Viterbi decod- 
ing and the performance of concatenated codes,” Publi- 
cation 81-9, Jet Propulsion Laboratory, Pasadena, CA, 
September 1981. 

As noted above, the time domain decoder is more 
efficient in area than the transform domain decoder for 
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very long RS codes. One such example is the long, 10 
bits/symbol (1023,959) RS code being presently consid- 
ered for very deep space probes. If this code is concate- 
nated with a (15,1/5) convolutional code, it achieves a 
BER of 10 ~ 6 at a SNR of 0.5 dB. Evidently the new 5 
NASA concatenated coding system provides a 2 dB 
improvement over the present baseline system. See J. H. 
Yuen and Q. D. Vo, “In search of a 2-dB coding gain ,’ 9 
TDA Progress Report 42-83, Jet Propulsion Labora- 
tory, Pasadena, CA, July-September 1985. It is for this io 
reason and many other applications that it is important 
to develop an efficient, VLSI implementable, time- 
domain decoder to decode RS codes. 

An algorithm was developed by I. S. Reed, T. K. 
Truong and R. L. Miller, “Decoding of B.C.H. and RS 15 
codes with errors and erasures using continued frac- 
tions,” Electronic Letters, vol. 15, no. 17, Aug. 16, 1976, 
pp. 542-544 for time domain decoding RS codes to 
correct both errors and erasures by the use of continued 
fractions or its equivalent, Euclid’s algorithm. This 20 
algorithm is a modification of the Berlekamp-Fomey 
method. See G. D. Forney, “On decoding BCH codes,” 
IEEE Trans, on Information Theory , vol. IT-11, 1965, 
pp. 549-557, and E. R. Berlekamp, Algebraic Coding 
Theory , McGraw-Hill, 1968. In this algorithm, the con- 25 
tinued fraction algorithm is used to find the error loca- 
tor polynomial from the remainder of the formal power 
series for the Forney syndrome. The disadvantage of 
this algorithm is that after the error locator polynomial 
is obtained by continued fractions, two polynomial 30 
multiplications are needed to compute the errata locator 
polynomial and the errata evaluator polynomial from 
the known error locator polynomial. 

SUMMARY OF THE INVENTION 

35 

The present invention relates to a method and archi- 
tecture for time or transform domain decoding of RS 
codes. It corrects both errors and erasures in the time 
domain decoding of RS codes by the use of the Euclid- 
ean algorithm without separate computations of the ^ 
errata locator polynomial and the errata evaluator poly- 
nomial. This is possible because the errata evaluator 
polynomial A(x) and the errata locator polynomial r(x) 
can be obtained simultaneously from the Forney syn- 
drome polynomial defined by 

T(x)»S(x)A(x)modx^~ 1 

where 

S(x) - x( ^(jc) mod = M x)Mx) 

The procedure for a time domain decoder is composed 
of the following five steps using equations set forth in 
the description of preferred embodiments: 

(a) Compute the transform of the received m-tuple vec- 
tor over GF( 2 m ) from Eq. (2). Next calculate the 
erasure locator polynomial A(x) from Eq. (4a) and 
define deg {A(x)}=v. 

(b) Compute the Forney syndrome polynomial T(x) in 
Eq. (7). 

(c) To determine the errata locator polynomial r(x) and 
errata evaluator polynomial A(x), where 0=v=d-l, 
apply the Euclidean algorithm to x rf ~* and T(x) as 
given by Eq. (7). The initial values of the Euclidean 
algorithm are ro(x)=A(x), T31 i(x)=0, R 31 i(x)=x<*-* 
tfrt^i?o(x)— T(x), The recursion in Eq. (9) for R^x) and 
Ts(x) terminates when deg{R*(x)}^ |(d+v— 3)/2J 


4 

for the first time for some value s=s'. Finally, com- 
pute A(x) and r(x) from Eqs. (10a) and (10b). For 

v=d— 1, set r(x)=A(x) and A(x)=T(x). 

(d) Compute the errata values from Eq. (11). 

A transform domain decoder may also be used with 
this same method which differs only in the last step set 
forth above. For the time domain decoder, the errata 
locator polynomial t(x) is processed by a Chien search 
operation, and by a unit which computes [x 11 V(x)] -1 . 
The errata evaluator polynomial A(x), is used for a 
polynomial evaluation. The product of the errata evalu- 
ation polynomial and [x m T'(x)] _1 is computed as the 
magnitude of the error. The Chien search process is 
used to search for the error and erasure locations by 
finding the roots of r(x). For the transform domain 
decoder, the errata locator polynomial r(x) from the 
last step of the first part I and the syndromes from the 
initial syndrome computation from the first step of the 
first part I are subjected to a transform error pattern 
calculation which produces extended syndromes. The 
sequence of the error pattern is then inverted. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 A illustrates a block diagram of a pipeline 
(255,233) RS time domain decoder suitable for VLSI 
fabrication. 

FIG. IB illustrates a block diagram of a pipeline 
(255,233) RS transform domain decoder very similar to 
the time domain decoder in the first part I, but simpler 
in the second part II, although less suitable for VLSI 
fabrication in longer length RS codes. 

FIG. 2 is an example of the Euclidean algorithm used 
to find r(x) and A(x) in FIGS. 1A and IB. 

FIG. 3 is a block diagram of the syndrome computa- 
tion in FIGS. 1 A and IB. 

FIG. 4 is a block diagram of the power calculation in 
FIGS. 1A and IB. 

FIG. 5 is a block diagram of the power expansion in 
FIGS. 1 A and IB. 

FIG. 6 is a block diagram of the polynomial expan- 
sion in FIGS. 1 A and IB. 

FIG. 7A is a block diagram of the modified GCD in 
FIGS. 1A and IB. 

FIG. 7B illustrates the implementation of a cell in the 
modified GCD in FIG. 7A. 

FIG. 8 is a block diagram of the polynomial evalua- 
tion unit in FIG. 1A. 

FIG. 9 is a block diagram of the [x 111 t'(x )]“ 1 compu- 
tation unit in FIG. 1A. 

FIG. 10 is a block diagram of the transform error 
pattern computation unit in FIG. IB. 

FIG. 11 is a block diagram of the inverse transform 
unit in FIG. IB. 

FIG. 12 is a diagram for the systolic array for com- 
puting the transform error pattern in FIG. IB. 

DESCRIPTION OF PREFERRED 
EMBODIMENTS 

The Time Domain Decoder for RS Codes 

In this section, a new algorithm for correcting both 
errors and erasures in the time domain decoding of RS 
codes, shown in FIG. 1A, is described. In this new 
algorithm, the Euclidean algorithm is used to solve the 
Berlekamp-Fomey key equation for the errata locator 
polynomial and the errata evaluator polynomial di- 
rectly and simultaneously. The advantage of this new 
algorithm over previous methods discussed by I. S. 
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Reed, T. K. Truong and R. L. Miller, “Decoding of 
B.C.H. and RS codes with errors and erasures using 
continued fractions,” Electronic Letters, vol. 15, no. 17, 
Aug. 16, 1976, pp. 542-544, is that separate computa- 
tions of the errata locator polynomial and the errata 
evaluator polynomial usually needed can be avoided. 
This new decoding algorithm is highly suitable for both 
VLSI and software implementation. 

First, let GF(2 m ) be a finite Galois Field of 2 m ele- 
ments. Also, let N=2 m — 1 be the length of the (N,I) RS 
code over GF(2 m ) with minimum distance d, were 
I=N— (d— 1) denotes the number of m-bit message 
symbols and d— 1 denotes the number of parity symbols 
such that d— 1 is either an even or an odd integer. 
Define the following five vectors: 
e=(co,ci, . . . ,Qv-i), code vector, 
r=(ro,ri, — ,rjv-i)> received vector, 
e=(eo,ei, . . . ,eiv-i), error vector, 
u=(up,uu . . . ,117V- 1)» erasure vector, 
u=(uo,ui, . . . , ujv-i), errata vector. 

These vectors are related by u=e+u and r=c+u+e. 

Suppose that a number of errors (t) and erasures (v) 
occur in the received vector r, and assume that 
v+2t^d— 1. Next let a be a primitive element in the 
GF(2 m ). Then y=a l is also a primitive element in 
GF(2 m ), where (i,N)=l. 

To minimize the complexity of an RS encoder it is 
desirable that the generator polynomial by symmetric. 
If y is a root of the code’s generator polynomial, it is 
shown by E. R. Berlekamp, “Bit-Serial Reed-Solomon 
Encoders,” IEEE Trans, on Information Theory, vol. 
IT-28, no. 6, November 1982, pp. 869-874, that the 
generator polynomial g(x) is symmetric, if and only if 


b+id-2) 

*(*>= v (*-y9= * gpt, 
i = b '=0 
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Following Algebraic Coding Theory , define four differ- 
ent polynomials in the following manner. The erasure 
locator: 


5 


A<*) = 


7T (l.-jrj*)- l (1 -Xjx) = ii-iyAjxi 
XjeA j = 1 ;=° 


(4a) 


10 The error locator: 


(4b) 


15 * 


X(x) = 7T (1 - Xjx) = IT (1 - Xjx) « 2 (-1 )V 
Xj€\ j — l J=° 


The errata locator: 


20 r(x) « A(x)X(x) = tt ^(1 — Xjx) = JJ* (-1 yrjxi 


(4c) 


ro= 1. The errata evaluator: 


25 


A(x) 


' =l I'W J 


(4d) 


3 q In terms of the polynomials defined above, Eq. (3b) 
becomes 


35 


S(x)r(x) = A(x) + 


Yj^ +d ~ l ^(1 - Xpc) j 


(5) 


From Eq. (5), one obtains the congruence relation, 


where go=&f-l = l and b satisfies the equality 40 
2b+d— 2=2 m — 1. The syndromes of the code are 
given by 


AJ 1 ft | (?) 

S( [b-D+k = ,2 q urfW-'+Q = ,2 q (u 1 + e,-) y *»-»+*> 45 


S(x)T(x)*A(x)modx rf ~ 1 (6a) 

It is shown in T. K. Truong, W. L. Eastman, I. S. Reed, 
and I. S. Hsu, “A simplified procedure for correcting 
both errors and erasures of a Reed-Solomon code using 
the Euclidean algorithm,” submitted to Proc. IEE, 1987, 
that Eq. (6a) can be solved to yield 


= YjjA b -^ +k for 1 ^ k ^ d - 1 

j~ 1 


S{x) 


43- 

\(x)A(x) 


mod x ^~ 1 


(6b) 


where Xy is either the ] th erasure or error location, and 50 
Yyis either the ] th erasure or error magnitude. Define the 
set, A={X/|X/is an erasure location} and \={Xj |X/is 
an error location}. Define the syndrome polynomial as 


d~\ , . (3a) 

Six) - ^2 1 S(*_ !)+***“ 1 

Then it is not difficult to show (see E. R. Berlekamp, 
Algebraic Coding Theory , McGraw-Hill, 1968) that 


It is well known, e.g. see I. S. Reed, T. K. Truong and 
R. L. Miller, “Decoding of B.C.H. and RS codes with 
errors and erasures using continued fractions,” Electron- 
ics Letters, vol. 15, no. 17, Aug. 16, 1976, pp. 542-544., 
55 that the maximum number of errors in an RS code 
which can be corrected is l^d— 1— v)/2j where |_x] 
denotes the greatest integer less than or equal to x, i.e., 
the principal part of x, or expressed differently, it means 
“greatest integer =x,” discarding any fractions. In this 
60 expression L(d + v — 3)/2J , d is a fixed number (designed 
distance of code) and v is the number of error locations 
counted. Now define the Forney syndrome polynomial. 


Six) m 


d - 1 , . 

^3(6-1)+ ax*- 1 « 


(3b) 


DEFINITION 1 


65 The Forney syndrome polynomial is defined by 

v±t YjX* y±t YjXj*+*-' 

j— 1 (1 — Xjx) j— i (1 — Xjx) T(x)»S(x)A(x) mod x^ 1 (7) 
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7 

By Eq. (7), the key part of Eq. (6b) for \(x) and A(x) 
is: 


(8) 5 

where deg {X(x)} = |^d — 1 — v)/2] and deg 

{A(x)}^ (d-j-v— 3)/2 o It is shown in the following 
important theorem that the errata evaluator polynomial jq 
A( x) and the errata locator polynomial r(x) can be ob- 
tained simultaneously and simply from the known T(x) 
in Eq. (7), and the new key Eq. (8), which takes into 
account both errors and erasures. 

THEOREM 1 15 

Let T(x) in Eq. (7) be the Forney syndrome polyno- 
mial of a t-error and v-erasure correcting RS code 
under the condition v-f-2t^d— 1 where d— T is either 
an even or an odd integer. Consider the two polynomi- 20 
als M(x)=x rf ~ 1 and T(x)=sS(x)A(x) mod x^- 1 . Then the 
Euclidean algorithm for polynomials on GF(2 m ) can be 
used to develop two finite sequences R^(x) and r^x) 
from the following two recursive formulas: 

Ts(x)=(-%_ i(x))t s _ i(x)+r s _ 2 (x) 



and 

R s ( X )=R J _ 2 (x)-q J -i(x)R J _i(x) (9b) 30 

for (s=l,2 0 o. . )» where the initial conditions are 
to(x)=A(x) j t_i(x)=0, R_i(x)=M(x), and 

Ro(x)=T(x). Here q 5 -i(x) is obtained as the principal 35 
part of R,- 2 (x)/R,_i(x). The recursion in Eq. (9) for 
Rs(x) and r^x) terminates when deg 
{Rs(x)}^ [(d+v— 3)/2J for the first time for some 
value s=s'o Let 

40 

A(x)=Ry(x)/A (10a) 

and 

t(x)=ty(x)/A (10b) 45 

Also in Eq. (10b), A=r^(0) is a field element in GF(2 m ) 
which is chosen so that tq= 1. Then A(x) and r(x) in Eq. 
(10) are the unique solutions of 

50 

A(x)»T(x)r(x) mod \ d ~ 1 (10c) 

where both the inequalities, deg {r(x.){ — |(d+v— 1)/2J 
and deg { A(x)} ^ Ifd+v— 3)/2J, are satisfied. 

Proof: See T. K. Truong, W. L. Eastman, I. S. Reed, 55 
and I. S. Hsu, “A simplified procedure for correcting 
both errors and erasures of a Reed-Solomon code using 
the Euclideam algorithm/' submitted to Proa IEE, 
1987. 

CJX 

Theorem 1 is a proof that the idea in W. L. Eastman, 
“Decoding Erasures,” The Mitre Corporation Report, 
Bedford, MA, 1986 is correct. 

The roots of r(x) are the inverse locations of the t 
error and v erasures, referred to hereinafter as t and v, 55 
respectively. These roots are most efficiently found by 
the Chien search procedure. By Eq. (4d), it is shown 
readily that the errata values are 


-W 1 


for 1 S * S v 


+ t 


(ID 


where T'(xfc _1 ) is the derivative with respect to x of 
r(x), evaluated at x=X/t _1 . 

The overall time domain decoding of RS codes for 
correcting errors and erasures using the Euclidean algo- 
rithm is summarized in the following steps: 

(a) Compute the transform of the received m-tuple 
m-bit vector over GF(2 m ) from Eq. (2). Next calcu- 
late the erasure locator polynomial A(x) from Eq. (4a) 
and define deg {A(x)}~v. 

(b) Compute the Forney syndrome polynomial from 
T(x) in Eq. (7). 

(c) To determine the errata locator polynomial r(x) and 
errata evaluator polynomial A(x), where 0^ v<d— 1, 
apply the Euclidean algorithm to x d ~ l and T(x) as 
given by Eq. (7). The initial values of the Euclidean 
algorithm are ro(x) = A(x),r _ i(x) = 0,R _ i(x) =x d ~ l 
and Ro(x)=T(x). The recursion in Eq. (9) for R^x) 
and Tj(x) terminates when deg 
{RXx)}^ Kd+v—3)/2J for the first time for some 
value s=s'. Finally, compute r(x) and A(x) from Eqs. 
(10a) and (10b). For v=d— 1, set t(x)— A(x) and 
A(x)=T(x). 

(d) Compute the errata values from Eq. (11). 

To illustrate the time domain decoding procedure for 
correcting errors and erasures, an elementary example 
of an RS code over GF(2 4 ) is now presented. The repre- 
sentation of the field GF(2 4 ) generated by the primitive 
irreducible polynomial g(x)=x 4 -hx-hl is given in the 
following Table A. 
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0 

1 

0 

a 2 

0 

1 

0 

0 

a 3 

1 

0 

0 

0 

a 4 

0 

0 

1 

1 

a 5 

0 

1 

1 

0 

a6 

1 

1 

0 

0 

a 7 

1 

0 

1 

1 

a 8 

0 

1 

0 

1 

a 9 

1 

0 

1 

0 

a 10 

0 

1 

1 

1 

all 

1 

1 

1 

0 

al 2 

1 

1 

1 

1 

a!3 

1 

1 

0 

1 

a* 4 

1 

0 

0 

1 


EXAMPLE 1 

Consider a (15,9) RS code over GF(2 4 ) with mini- 
mum distance d=7. In this code, v erasures and t errors 
under the condition 2t+v^=d— 1 can be corrected. In 
order to simplify this example, let y =a and b= 1. Thus, 
the generator polynomial of such a (15,9) RS code is 
defined by 

6 

g(x) = 7T (x - a9 = 
i=l 

x 6 -f a 10 * 5 + a 14 * 4 + a 4 * 5 + a 6 * 2 + a 9 x + a 6 

Assume the message symbols are 
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9 

I(x)=a 10 x 14 +a l2 x 13 +a 8 x 12 +a 5 x 11 +a 6 x 10 + 

a 14 x 9 +a 13 x 8 +a u x 7 +a 9 x 6 . 


By Eqs. (10a) and (10b), one has 


The encoded codeword, which is a multiple of g(x), is 

c(x)=a 10 x 14 +a 12 x 13 +a 8 x 12 +a 5 x 11 +a 6 x 10 + 

a 14 x 9 +a 13 x 8 +a n x 7 +<z 9 x 6 +x 5 +- 

ax 4 +a 2 x 3 +a 6 x?+a 12 x+a 8 . 


Written as a vector, the codeword is 

10 

,a,a 2 ,a 6 ,a 12 ,a 8 ). 


Assume the erasure vector is 

u= (0,0, 0,0, 0,0,0, a 2 , 0,0, 0,0,0, 0,0) (12) 


r( x) = At n(x) = aV 4- a u x 2 4- a 2 x 4- 1 < 18) 

cr 

and 

A(x) = -V Mx) = oV + o 14 x + 1 (19) 

a 1 

By use of Chien search, the roots of r(x) constitute the 
set {a - 7 , a~ 3 , a- 10 }. The derivative with respect to x of 
r(x) in Eq. (18) is r'(x)=a 5 x 2 +a 2 . Thus, the errata 
values are 


and error vector is 

e= (0,0, 0,0, a 1 \ 0,0, 0,0, 0,0, a 7 , 0,0,0) 

Then the errata vector is 

u = u-f e = ( 0 , 0 , 0 , 0 ,a 1 0 , 0 , a 2 , 0 , 0 , 0 , a 7 , 0 , 0 , 0 ) 

Assume the received vector is 

r=c-f*u=(a 10 ,a 12 ,a 8 ,a 5 ,a,a 14 ,a 13 ,a 9 

,a 9 ,a°,a,a 12 ,a 6 ,a 12 ,a 8 ) 

The syndromes S* for r are 


(13) 


20 



( 15 ) 


30 


^f 1 ) Ala-* 
r'(o- 7 ) 



a 5 (a ~ 7 ) 2 4- a 14 (a -7 ) 4- 1 
a 5 (a ~ 7 ) 2 + a 2 

wr') 

a 5 (a -3 ) 4- a 14 (a“ 3 ) 4- 1 i 

II 

i 

L 

a 5 (a “ 3 ) 2 4 - a 2 


and 


a 5 (a -10 ) 2 4- a l 4 (a -10 ) 4-1 n 

’ iW 1 ) 

a 5 (a — l0 ) 2 4 - a 2 


Sk = 2 r„a nk = a 7 (a 3 )* + a 2 (a 7 ) k + a u (a 10 ) 4 for 1 S k S 6. 
n= 0 


This yields Si=a°, S 2 =a 13 , S 3 — a 14 , S 4 =a n , S s=a, 
andS 6 = 0 . Thus, the syndrome polynomial is S(x)=a°~ 
-ba 13 x+a 14 x 2 +a ll x 3 -bax 4 + 0 x 5 . 

The erasure locator polynomial is A(x)=(l +a 7 x). In 
this example, the maximum erasure correcting capabil- 
ity is Ifd — 1 — v)/ 2 J = 1(7— 1 — l)/Zl =2. By Eq. (7), 
one obtains the Forney syndrome polynomial as 

7(x) *■ ,4(x)S(x) * (1 4- a 7 x) (1 4- a 13 x + a 14 x 2 4- (16) 

a 11 * 3 + ax 4 + Ox 5 ) mod x 6 * (Ox 6 + c^x 5 + cAe 4 4- ax 3 4- 
a 12 x 2 + a 5 x 4- a 0 ) mod x 6 = a 8 x 5 + a 9 x 4 + 

ax 3 4- a 12 x 2 4- a 5 x 4- a 0 

In Eq. (16), the coefficients of T(x), To=a°, Ti=a 5 , 
T 2 =a 12 , T 3 =a, T 4 =a 9 , and Ts=a 8 are the Forney 
syndromes. 

The Euclidean algorithm is applied next to polyno- 
mial x d ~ 1 and T(x) in Eq. (16). By this means, polynomi- 
als t(x) and A(x) are determined next by use of the 
Euclidean algorithm. This is accomplished by the recur- 
sive Eqs. (9a) and ( 8 b) as illustrated in Table 1 shown in 
FIG. 2 where initially R_i(x)=x rf_ 1 =x 6 and 
Ro(x) = a 8 x 5 4* a 9 x 4 -f ax 3 -b a 12 x 2 + a 5 x + 1 . From Table 
1, it is seen that deg {Ry(x)}=deg 
{R 2 (x)} = 2 = L(d-hv— 3 )/H = 2. Thus, the computation 
terminates at this point s' = 2 , and 


THE TRANSFORM DECODER FOR RS CODES 

The transform decoder of Gore and Mandelbaum 
35 (W. C. Gore, “Transmitting binary symbols with Reed- 
Soloznon code,” John Hopkins EE Report 73-75, April 
1973, and D. Mandelbaum, “On decoding Reed-Solo- 
mon codes,” IEEE trans, on Information Theory , vol. 
IT-17, 1971, pp. 701-712) was developed further in I. S. 
40 Reed, T. K. Truong and R. L. Miller, “Simplified algo- 
rithm for correcting both errors and erasures of Reed- 
Solomon codes,” Proc. IEE, vol. 126, no. 10, October 
1979, pp. 961-963 to correct both errors and erasures. 
This decoding procedure was based on the algorithm 
45 orginally invented by Forney, “On decoding BCH 
codes,” IEEE Trans, on Information Theory \ vol. IT- 11, 
1965, pp. 549-557. (See also I. S. Reed, T. K. Truong 
and R. L. Miller, “Simplified algorithm for correcting 
both errors and erasures of Reed-Solomon codes,” Proa 
50 IEE ; vol. 126, no. 10, October 1979, pp. 961-963.) By 
the above-mentioned Euclidean algorithm, the trans- 
form domain decoding procedure in Proa IEE just cited 
can be simplified further. 

By the same procedure used in the time domain de- 
55 coder, one can obtain the errata locator polynomial 
given in Eq. (4c). Hence, 


T<x~ l ) = 1 + (-1)7107') + (-D 2 T2(*r‘) 2 + . . . + (20) 

60 (-l) , +'r v+ «(A7 1 ) v + t = 0 for 1 S i S v + t 

Multiplying Eq. (20) by Y/X,< 6 _1 )+* yields 


R. 2 (x) = a 7 x 2 -f ax + a 7 
and 


(17a) ^ naf~ 1)+ * - 


( 21 ) 

(_l)v+l Tv+I y^-l)+*-(»+0 = o 


T 2 (x) = a 7 x 3 4- a 13 x 2 4- a 4 x 4- a 2 


(17b) Summing Eq. (21) over i for l=i^v+t, produces 



11 


4,868,828 


12 


v£t Y rf b - l)+k _ n K;X p-l)+/c-l + 


The result is u= (0,0, 0,0, a 11 , 0,0, a 2 , 0,0,0, a 7 , 0,0,0). The 
( 22 ) corrected code is thus 


i=i 


i=i 


c — r — u = (a 10 ,a 12 ,a 8 ,a 5 ,a, a 14 ,a 13 ,a 9 ,a 9 ,a°,a, a 12 . 


(27) 


From Eq. (22), one has 


• • • (- 1 y + 'T,+, "II 1)+ ' Mv+0 = 0 a 6 a 12 i0 8) _ (0,0, 0,0, all, 0,0, a 2 0 , 0 ,0, a’, 0,0,0,) =(a>°, a 12 , 

a 8 ,a 3 ,a 6 ,a 14 ,a 13 ,a 11 ,a 9 ,a°,a f a 2 , a 6 , a 12 , a 8 ). 


S(b-\)+k - nS(b-i)+k-i + 


(23) 

. . . (- l ) H - f n+ ^_ 1Hft „ (H _ 0 = 0 


Hence, in general, 


15 


E(b-l)+k ~ r\E(b-X)+k-\ + ( 24 ) 

. . . (-iy+ v T t+v E(b-\)+k-(y+t) = 0 for k ^ d 

are the recursive equations for E^, the transforms of the 20 
errata pattern, where initially E/,=S&, E^+i— Sb+h * . . 

, E&+rf_ 2 =S& + </_ 2 are known from the prior syndrome 
calculation. 

From Eq. (24), one obtains the rest of the transform 
of u, i.e., the S/for O^l^N — 1. The amplitude u vector 25 
is found by taking the inverse transform over GF(2 m ) of 
S/s O^l^N— 1. Finally, the original m-tuple code vec- 
tor can be obtained by subtracting u from the received 
vector r. 

The above transform decoding algorith of RS codes 30 
for correcting both errors and erasures, using trans- 
forms over GF(2 m ) and the Euclidean aglorithm will 
now be recapitulated. This procedure is composed of 
the following five steps: 

(a) Use step (a) in the time domain decoder. 35 

(b) Use step (b) in the time domain decoder. 

(c) Use step (c) in the time domain decoder. • 

(d) Compute the rest of the transform of the errata 
vector by the use of Eq. (24). 

(e) Invert the transform to recover the errata vector 
using the fact that So = Sat. 

Then obtain the corrected code vector. 

To illustrate the transform domain decoder for cor- 
recting errors with erasures, the data for the (15,9) RS 
code over GF(2 4 ) used in Example 1 is again used. 

EXAMPLE 2 

Consider the (15,9) RS code over GF(2 4 ) with d=7. 

For this code, the erasure, error, errata, and received 5Q 
vector are given by Eqs. (12), (13), (14) and (15), respec- 
tively. By Eq. (18), the errata locator polynomial is 

r(x)=ro-hr][x+T 2 X 2 +T 3 X 3 = 1 -ha 2 x+a u x 2 +a 5 x 3 

% 

where ro=l, ri=a 2 , T 2 =a u , and T 3 =a 5 . 

By Eq. (23), the rest of the transform of the errata 
vector is 

Sk = a 2 S*_i + a il .Sfc _2 + a 5 Sfc_ 3 , for 7 ^ 15. 

That is, 87 = 013 , Sg=a 13 , S 9 =a 7 , Sio=a 3 , Sn=a 5 , 
Si 2 =a 13 , Si 3 =a 5 , Su=a 5 , and So=L The inverse 
transform of S* is 


15-1 . 

uk = 2 S„a~ nk for 0 ^ k ^ 14 

«— 0 


10 IMPLEMENTATION OF VLSI ARCHITECTURE 
OF THE TRANSFORM DECODER AND THE 
TIME DOMAIN DECODER 

In the description that follows of the various func- 
tional blocks in FIGS. 1A and IB, it should be under- 
stood that what is disclosed in those figures are digital 
systems which are customarily synchronized by a clock 
signal from a common source. It therefor follows that 
the various blocks to be described below are all syn- 
chronized by clock signals to registers, latches, detect- 
pors and the like. The synchronizing clock system is not 
shown in order that the figures not be cluttered with 
what those skilled in the art understand to be implied. 

FIG. 1A depicts a block diagram of a (255,223) RS 
time domain decoder. FIG. IB shows the block diagram 
of a (255,223) RS transform domain decoder. Each 
block diagram can be separated into two parts I and II, 
separated by dashed lines, as shown in both FIGS. 1A 
and IB. The first part I in both block diagrams has 
similar VLSI architectures. The major functional units 
in this part are: the syndrome computation unit 1 , the 
power calculation unit 2, the power expansion unit 3, 
the polynomial expansion unit 5 for the decoder of FIG. 
1A and 5 ? for the decoder of FIG. IB, and the 
[(d+v— 3)/2J generator 6 . Also included in this part 
are some delay registers la in FIG. 1 A and delay regis- 
ters la and lb in FIG. IB, and a modified GCD 8 . The 
lengths of the delay registers may not be equal in these 
two decoder architectures, but since they contain only 
replicated register cells, they can be considered identi- 
cal in architecture. 

FIG. 3 shows a block diagram of the syndrome com- 
putation 1. This unit accepts the received messages and 
computes their syndromes S(x). There are 32 syndrome 
cells in a (255,223) RS decoder. Each cell 1 , 2 ... 32 
depicted in FIG. 3 performs the operation as Sf*-(S,+- 
r/aOs where ” deotes the operation “is replaced by”. 
The three elements of each replicated cell perform the 
same operation. Element 11 is an adder, element 12 is a 
multiplier with a multiplicand constant a', where i is the 
number of the cell, and element 13 is an eight bit regis- 
ter. The Berlekamp multiplier is used in this syndrome 
unit for the multiplier 12 due to its simplicity in VLSI 
design. The computed syndrome polynomial is labelled 
55 as S(x) in both FIGS. 1A and IB. In the time domain 
and transform domain decoders, the coefficients of S(x) 
are fed in parallel to the polynomial expansion unit 5 or 
S', to compute the Forney syndromes. 

FIG. 4 shows the block diagram of the power calcu- 
( 25 > 60 lation 2 which converts the received Fs and 0 ’s into a 
sequence of a**s and 0 ’s, where a is a primitive element 
of the finite field over which the RS code is defined. 
The Fs and 0’s received at AND gate 21 indicate the 
occurrence or nonoccurrence, respectively, of an era- 
65 sure at a specific location. Since the maximum erasure 
(26) correcting capability of a (255,223) RS decoder is 32, 
only 32 symbol latches in block 22 are needed to store 
the locations of all the correctable erasures. However, 


40 


45 
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the output of the content of the 32 symbol latches is 
followed by 255-32 zeroes. A register R stores an 8-bit 
symbol which is read out in synchronism with each 
erasure location symbol into a multiple unit labeled 
“xa,” and then restored. When that recirculated synm- 5 


14 

out of the cells. By the time the fifth has been reached 
by the multiplexer, the first cell has completed the GCD 
computation, and the demultiplexer has read out r(x) 
and A(x), or just r(x), so the multiplexer may again 
address the first cell. In that way, the 5 cells are time 


bol is applied to the AND gate 21 in synchronism with shared for 32 GCD computations, 

an erasure location symbol, it produces an erasure loca- The inputs d(R/) and d(Q/) are degree inputs com- 
tion symbol bit a k in the kth position of the 32 erasure pared with I to determine if d(R/)=I or d(Q/)=I. If 
location symbols, shown as a 1 , a 3 , and a 4 . The other 32 either is greater than I, stop operation. Next compare 
symbol positions are filled with symbol zeros, as are the 10 d(R,) and d(Q/). If d(Q/) is greater than d(R/), all paired 
remaining 255 symbol positions after 32 erasure location inputs are switched as indicated schematically, other- 
symbols. wise paired inputs are passed on in the same channels as 

A zero detector 23 for detecting the occurrence of presented at the input terminals. Next there is a function 

erasures is included in the power calculation unit. If an of degree updating by subtracting one (1) from the 

erasure occurs at the k /A location, its corresponding 15 greater of the degree inputs. The last function in the 
symbol a k is calculated and stored in the proper latch degree channel is to verify that d(R/+i)=I or d(Q/)=iI. 
position. The sequence of a h s is fed to the polynomial If either is true, stop and output the data, and if not, 
expansion unit, to the power expansion unit, and to the increment j ifa) and allow recursion operation for an- 
|£d+v— 3)/2J generator. other cycle. That is repeated until the condition is satis- 

The power expansion unit 3 converts the a^s se- 20 fied. A synchronized “start” signal is delayed by a fixed 
quence into an erasure locator polynomial A(x) which amount which is the largest time expected for the cell to 
has a^s as its roots. FIG. 5 depicts the block diagram of perform its recursive function. The output of the delay 
this unit. The a**s are applied to thirty-two multipliers signal indicates to the demultiplexer that it is ready to 

31 in parallel, and to a nonzero detector 32 . For each have its outputs from the polynomial arithmetic units 

nonzero a ki detected, a gate 33 is opened to store the 25 read out, as shown schematically in FIG. 7B. This ar- 

sum of the produce P/x a ki and the previous sum of the chitecture for a single cell of this architecture is dis- 
previous register P/_ i. Each register Pi through P 32 is closed for a series connected set of 32 cells in a paper by 
an 8-bit register. The sum is formed by an adder 34 . Howard M. Shao, T. K. Truong, Leslie J. Deutsch, 

Note that the adder associated with the register Pi Joseph H. Yuen and Irving S. Reed, “A VLSI Design 

always adds zero to the product Pi x a ki . The output of 30 of a Pipeline Reed-Solomon Decoder,” IEEE Transac - 
the power expansion unit is the erasure locator polyno- tions on Computers, Vol. C-34, No. 5, May 1985. Here 
mial A(x) which is fed serially out of the register P 32 to fewer cells are being used on a time-sharing basis, 

the modified GCD (greatest common denominator cal- The polynomial A(x) together with the Forney syn- 
culator) unit 8 as one of the initial conditions. drome polynomial T(x) are two of the three inputs to 

The generator 6 is used to compute L(d-j-v— 3)/2J as 35 the modified GCD unit. The stop indicator for the Eu- 
noted hereinbefore. This is shown in both FIGS. 1A clidean algorithm capability of the code is computed by 

and IB. The output of this generator, which is imple- block 6 which provides the third input 

mented in a straight forward way to add d and v, sub- 1= [(d+v— 3)/21 . The output of the modified GCD 

tract 3 and divide by 2, is sent to the modified GCD unit unit is the errata locator polynomial r(x) and the errata 

8 and used as a stop indicator for the Euclid’s algorithm. 40 evaluator polynomial A(x). In operation of the GCD 
FIG. 6 shows a block diagram of the polynomial unit 8 , a multiplexer 81 receives all of the three inputs 

expansion circuit 5 . The Forney syndromes for either and distributes them to cells 1, 2 ... n, each of which is 

the time do main decoder or the transform decoder are a complete modified GCD unit. A demultiplexer 82 

calculated in this unit. It is organized very much like the connects the outputs A(x),r(x) and k of the cells in a 

power expansion unit 3 shown in FIG. 5 , except that 45 corresponding order. One further step is necessary for 
each 8-bit register S / of lower order is added through each r(x) output, which is to normalize r(x) polynomi- 

adder 51 to the 8-bit register S /+ 1 after multiplication by als by multiplying by the inverse of K obtained through 

a multiplier 54 in response to the output a ki when a an inverser 83 . A multiplexer 84 is provided for that 

nonzero is detected. A central parallel to serial con- function of normalizing. 

trol signal (PSi) causes the content of the register S/to 50 The differing functional units of the time and trans- 
be transferred to a register T/ when I has reached 32. form domain decoders are shown in the second parts of 

The register T/ is also an 8-bit register and is used for FIGS. 1A and IB labelled as II. One output of the modi- 

synchronized serial output of T(x) to the modified GCD fied GCD unit of the time domain decoder, the errata 

8. locator polynomial r(x), is fed to a Chien search unit 9 a 

FIG. 7A depicts the block diagram of the modified 55 which finds the root r(x) and to another unit 9b for 
GCD unit 8, and FIG. 7B depicts the architecture of computing [x*- !r'(x)] - 1 = [x 1 1 V(x)] - *, where b= 112 

one cell in FIG. 7A. As described previously by H. M. in this design. The other output of the modified GCD 

Shao, T. K. Truong, I. S. Hsu, L. J. Deutsch and I. S. unit of the time domain decoder, the errata evaluator 

Reed in “A single chip VLSI Reed-Solomon decoder,” polynomial A(x), is fed to the polynomial evaluation 

presented at the International Conference on Acoustics, 60 unit 10 to perform the evaluation of A(x). FIG. 8 shows 
Speech and Signal Processing, ICASSP, Tokyo, Japan, the block diagram of the polynomial evaluation unit 10. 

Apr. 7-1 1, 1986, a multiplexing scheme can be applied Each square block is an 8-bit register, and each circle is 

to the modified GCD unit to reduce the number of cells a Berlekamp multiplier with a fixed multiplicand a', 

needed. Normally the number of cells needed in this The input registers are Ao, Ai. ... A*, where n=v+t— 1 

example is 32 (the maximum number of erratas that can 65 and the output registers being summed are Aoa-°* 
be corrected), and they are normally connected in cas- Aia -1 * . . . A n a~ nk , where the a’s are precomputed 

cade. By using a smaller number of cells (n=5), it is and fixed. The summer provides the polynomial A(x- 

possible to multiplex and then demultiplex data into and )=A(a~( iV - i )), . . . A(a~ 2 ),A(a- l ),A(a~°). 
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The [x^T'fc)]" 1 unit 9a performs the calculation of 
one part of the errata magnitude. See H. M. Shao, T. K. 
Truong, I. S. Hsu, L. J. Deutsch and I. S. Reed, “A 
single chip VLSI Reed-Solomon decoder,” presented at 
International Conference on Acoustics, Speech and Signal 5 
Processing, ICASSP, Tokyo, Japan, Apr. 7-11, 1986. 
FIG. 9 shows the block diagram of this unit, where 
k=l, 2, . . . 255. Its implementation is straightforward 
with the use of an inverse element table. The product of 
the outputs of the polynomial evaluation unit and the 10 
[x 11 V(x)] -1 unit forms the errata magnitude. 

In the time domain decoder, the Chien search unit 9b 
is used to search for the error and erasure locations, as 
described in the paper just cited above. The implemen- 
tation of the Chien search unit is not shown because the 15 
Chien search is so well known. Basically it determines 
the roots of r(x). The preferred architecture of the 
Chien search unit is similar to that of the polynomial 
evaluation unit, except there is a zero detector at the 
end. 20 

For the transform domain decoder design, the output 
from the modified GCD unit is the errata locator poly- 
nomial t(x). This output is fed to the transform error 
pattern unit 9', along with the syndromes. S(x) from the 
syndrome computation unit 1, to calculate the extended 25 
syndromes. Operation of the new architecture for the 
transform error pattern unit 9' is described below. The 
realization of this idea is shown in the block diagram of 
the transform of the error-pattern unit, given in FIG. 

10. 30 

Referring to the block diagram of the transform of 
the error pattern, a gate Gi is opened by a control signal 
C, and a gate G 2 is held off by the complement C of the 
control signal. But first a set of 8 -bit latches Li through 
L 32 in cells 1 through 32 are loaded with the coefficients 35 
of r(x) in sequence with the first Li receiving the first 
coefficient To, the next receiving the next coefficient, 
until cell v+t receives the last coefficient t v -k-i. The 
rest of the flip-flops store a zero. Thus, t v +* represents 
the largest number n of errors in the error locator poly- 40 
nomial r(x). The output S(x) from the syndrome com- 
putation unit 1 is passed through cells 1 through 32 for 
multiplication by the coefficients in sequence by the 
separate multipliers Mi through M 32 . The products are 
added through adders Ai through A 32 , and the sums are 45 
stored in 8-bit registers Ri through R 32 After 32 symbol 
periods, Ri through R 32 each stores a sum of products, 
and gate G 2 is turned on for 255-32 symbol periods so 
that the output of register Ri is recirculated to the out- 
put terminal S'(x) to all of the multipliers Mi through 50 
M 32 - In that way, the registers Ri through R 32 are read 
out recursively and also fed back to each cell as inputs 
to reiterate the operation for an additional 255-32 cycles 
after the first 32 symbols pass to the output S'(x) in 
sequence. 55 

The transform error pattern S'(x) produced by the 
computation of extended syndromes r(x), together with 
the original syndromes S(x), is sent to the inverse trans- 
form unit 10' to obtain the estimated errata patterns. 
FIG. 11 shows the block diagram of the inverse trans- 60 
form. It is easy to see that the architecture for the in- 
verse transform is similar to that of the syndrome com- 
putation unit except that 255 cells are needed in the 
inverse transform unit while the syndrome computation 
unit needs only 32 subcells, and a parallel to serial regis- 65 
ter (Rq', Ri' • . . R' 254 ) is used. a~ n for each multiplier is 
constant and prestored for the inverse transform opera- 
tion. 


Clearly, the architecture of the transform domain 
decoder design is simpler than that of the time domain 
decoder design. This is because the transform domain 
decoder design needs only two regular function blocks 
and an adder 11' in part II, whereas the time domain 
decoder requires three function blocks plus a multiplier 
9c, and AND gate 9 d and an adder 11 to produce a 
decoded output. 

Furthermore, the inverse-transform unit 10 ' in the 
transform domain design contains 255 similar cells in 
the (255,223) RS decoder. It is estimated that these 255 
cells occupy only a moderate amount of silicon area, 
and that their geometric arrangement can be regular 
and simple. Therefore, time needed for design and test 
of such a VLSI chip can be saved substantially. How- 
ever, this advantage of the transform domain decoder is 
valid only for moderately short length RS codes. If long 
length RS codes are used to enhance the system's per- 
formance, the transform domain decoder nees a large 
inverse transform block. This might cause a problenm in 
the VLSI implementation. In general, if a GF(2 m ) field 
is used to define an RS code, an inverse transform block 
composed of 2 m — 1 cells is needed. Hence, the number 
of cells needed in an inverse transform block increases 
exponentially with the integer m. The number of tran- 
sistors needed in the time domain decoder goes up only 
linearly as the integer m increases. Therefore, for long 
length codes, the time domain decoder is the more ap- 
pealing approach. Although the computation complex- 
ity of the time domain decoder is more complex than 
that of the transform domain decoder, for long RS 
codes, the number of transistors needed in a time do- 
main decoder is substantially less than that in a trans- 
form domain decoder. 

EXAMPLE OF THE INVERSE TRANSFORM 
OPERATION 

In this example, a three-cell VLSI architecture is 
shown in FIG. 12 to compute the transform of the error 
pattern. Recently, Johnson et al., “Towards a formal 
treatment of VLSI arrays,” Caltech Conference on 
VLSI, January 1981 proposed a systolic array for com- 
puting a linear convolution. Using a technique similar to 
that suggested by Johnson et al., the transform of the 
error pattern in Eq. (23) or (24) can be converted to a 
recursive convolution formula and implemented in a 
systolic array. The advantage of this method over the 
previous method in H. M. Shao, T. K. Truong, L. J. 
Deutsch, H. H. Yuen and I. S. Reed, “A VLSI Design 
of a Pipeline Reed-Solomon Decoder,” IEEE Trans, on 
Computers, vol. C-34, no. 5, May 1985, is that the long 
delay needed in the large XOR tree used for summing 
all the terms in Eq. (23) is eliminated. Also the zero 
detectors needed in the previous design are not required 
in this new architecture. 

To illustrate the operation of this three-cell architec- 
ture shown in FIG. 12 , the data in Example 2 for a 
(15,9) RS code are used as an example. The recursive 
equation to compute the remainder of the transform of 
the error pattern is given in Eq. (25). Operation of the 
architecture for computing Eq. (25) is shown in FIG. 12 
with reference to just three cells and Table B. In this 
figure, the function of each cell can be described by a 
register transfer relation of the type R/<— Ri+i+S&a 1 *. 
The input data S/are sent to all the cells simultaneously 
and the output is taken from the register sequentially. 

To understand the operation of this circuit, assume 
initially that all registers R/for l=i=3 are set to zero. 
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The control signal C is high for 6 symbol clocks to 
allow data Si, S 2 , . . . , S 6 to be fed into the circuit. The 
input data are also sent to theoutput node. At the same 
time, the complement signal C of the signal C is low to 
prevent the data stored in register Ri from being sent to 5 
the output node. Note that one “clock time” for one 
Galois field symbol equals 4 circuit clock times. At the 
7th symbol clock time Jhe control signal C is switched 
to low or zero so that C= 1. Therefore, the data stored 
in register Ri, which equals S 7 at that moment, is sent to 10 
both the output node and fed back to all basic cells. This 
process continues until the rest of the transform of the 
error pattern, i.e., S 7 , Ss, . . . , S 15 are obtained. The 
detailed operation of this circuit is illustrated in Table 2 
below: 15 



Although particular embodiments of the invention 30 
have been described and illustrated herein, it is recog- 
nized that modifications and variations may readily 
occur those skilled in the art. Consequently, it is in- 
tended that the claims be interpreted to cover such 
modifications and variations. 35 

What is claimed is: 

1. A method for time or transform domain decoding 
of RS coded received messages which corrects both 
errors and erasures in decoding of RS coded messages 
by the use of the Euclidean algorithm without separate 40 
computations of the errata locator polynomial t(x) and 
the errata evaluator polynomial A(x) comprising the 
steps of 

(a) computing the transform of the received m-bit 
vector over the Galois Field GF(2 m ) from Eq. (2) 45 
for the initial syndrome computation, then calcu- 
late the erasure locator polynomial A(x) and de- 
fined deg {\(x)}=v, 

(b) computing the Forney syndrome polynomial T(x) 

from Eq. (7), 50 

(c) applying the Euclidean algorithm to x d ~ x and 

T(x) as given by Eq. (7) to determine the errata 
locator polynomial r(x) and errata evaluator poly- 
nomial A(x), using as initial values of the Euclidean 
algorithm ro(x)=A(x), t_i(x)=0, R_i(x)=x^“ 1 55 
and Ro(x)=T(x), wherein the recursion in Eq. (9) 
for Rj(x) and t^x) terminates when deg 
{Rj(x)}^ Kd+v— 3)/2J for the first time for some 
value s=s\ and then compute r(x) and A(x) from 
Eqs. (10a), and (10b), and 60 

(d) for time domain decoding, computing the error 
and erasure locations by finding the roots of the 
errata location polynomial r(x) by Eq. (4c), or for 
transform domain decoding, determining the error 
and erasure locations by subjecting the errata loca- 65 
tions polynomial r(x) and the syndrome S(x) from 
the syndrome computaton in step (a) to a transform 
error pattern calculation which produces extended 
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syndromes and inverting the sequence of the error 
pattern thus determined. 

2. A method as defined by claim 1 wherein step (c), 
for v—d— 1, t(x) is set equal to A(x) and A(x) is set 
equal to T(x). 

3. A method as defined by claim 1 or claim 2 for time 
domain decoding, where the last step (d) is comprised of 

(e) conducting polynomial evaluation of the errata 
evaluator polynomial A(x) and obtaining the values 
of polynomial A(x)=A(a~( iV ~ 1 )), . . . A(a~ 2 ), 
A(a -1 ), A(a-°), 

(f) computing from the errata locator polynomial r(x) 
the value (x^-^Xx))- 1 , where b is a constant 
chosen for the design of the decoder for a specific 
RS code, 

(g) determining from the errata locator polynomial 
t(x) the roots using a Chien search, 

(h) multiplying the result of step (e) with the result of 
step (f), 

(i) performing a logical AND between the polyno- 
mial results of step (h), and 

(j) adding the received message to the results of step 
(i) to produce a decoded output of the RS coded 
received message with errors and erasures cor- 
rected. 

4. A method as defined by claim 1 or claim 2 for 
transform domain decoding, where the last step (d) is 
comprised of 

(e) calculating from the errata locator polynomial 
r(x) and the syndromes from step (a), an extended 
syndrome S'(x) which constitutes a transform of 
the error pattern, 

(f) from the extended syndrome S'(x) produced in 
step (e) produce an inverse transform e/s, and 

(g) adding the received message to the inverse trans- 
form e/s of the transform error pattern to produce 
a decoded output. 

5. Apparatus for time domain decoding of received 
RS coded messages, which apparatus corrects both 
errors and erasures in decoding of RS codes comprised 
of 

syndrome means for initial syndrome computation of 
RS coded received messages for producing an 
extended syndrome s(x), 

means for erasure locator power calculation of era- 
sure location data received with RS coded re- 
ceived messages, 

means responsive to said syndrome s(x) of the initial 
received message for polynomial expansion of said 
initial syndrome to produce a Forney syndrome 
polynomial T(x), 

means for expansion of said erasure location power 
calculation to produce an erasure locator polyno- 
mial A(x), 

means for counting the number v of input erasures 
from said received RS coded messages, 

means responsive to said counting means for generat- 
ing the [d+v— 3/21 to produce a signal I, where d 
is a fixed number chosen for the designated RS 
code, and v is the number of erasure locations 
counted, 

means responsive to the signals T(x), A(x) and I to 
produce an errata location polynomial r(x) and 
errata evaluator polynomial A(x), 

polynomial evaluation means responsive to the errata 
evaluator polynomial A(x) for evaluation of said 
polynomial A(x) by producing the following val- 
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ues A(a“( jV “ 1 )), . . . A(a~ 2 ), A(a” 1 ) P A(a-°), 
where the multiplication of A(x) is in a Berlekamp 
multipher with the multiplicand a 1 fixed, where a is 
a primitive element in the Galois Field, 
value means responsive to said errata locator polyno- ^ 
mial r(x) for computing the value (x^V(x))” 1 , 
where b is a constant chosen for a specific RS code, 
means for determining the roots of the errata locator 
polynomial r(x) in order to determine the locations 
of errors and erasures, 

multiplication means responsive to said polynomial 
evaluation means and value means for forming 
resultant magnitudes, 

means for forming the logical AND of the result of 
said magnitude with said locations of errors and 
erasures, and 

means for adding said logical AND results to the 
original message received at said syndrome compu- 
tation means to form a decoded output constituting 
a restored message. 

6. Apparatus for transform domain decoding of re- 
ceived RS coded messages, said coded messages having 
a GF(2 m % where N=2 m is the length of the (N,I) RS 25 
code over GF(2 m ) with a minimum distance, d, which 
apparatus corrects both errors and erasures in decoding 
of received RS coded messages comprised of 
syndrome means for initial syndrome computation of 
said RS coded received messages for producing an 30 
extended syndrome s(x), 

means for erasure locator power calculation of era- 
sure location data received with said RS coded 
received messages, 

means responsive to said syndrome s(x) of the initial 
received message for polynomial expansion of said 
initial syndrome to produce a Forney syndrome 
polynomial T(x), 
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means for expansion of said erasure location power 
calculation to produce an erasure locator polyno- 
mial A(x), 

meams for counting the number v of input erasures 
from said received RS coded messages, 
means for generating the Id 4- v— 3/21 to produce a 
signal I, where d is a designed RS vector distance 
number chosen for the designated code, and v is 
the number of erasure locations counted, 
means responsive to the signals T(x), A(x) and I to 
produce an errata locator polynomial r(x) and 
errata evaluator polynomial A(x), 
means responsive to said errata locator polynomial 
t(x) and said syndrome S(x) of the initial received 
message for producing extended transforms S'(x) of 
the errata pattern t(x), 

means for inverting the transform S'(x) to recover the 
errata vector e/s, and 

combining the errata vector e/s with the RS encoded 
received message in an adder to produce a decoded 
output of the RS coded received message with 
errors and erasures corrected. 

7. Apparatus as defined in claim 6 wherein said means 
for producing extended transforms S'(x) of the errata 
pattern is comprised of 

a first switch for introducing symbols Si through S n 
to n cells, where each cell comprises 
a register for storing one preloaded and distributed 
value of r(x), 

a multiplier for multiplying each of said symbols Si 
to S n by said one preloaded and distributed value 
of r(x) they are introduced by said first switch, 
an adder for adding the product of said multiplier 
and accumulated products of a preceding cell 
35 except that the first cell n, where the cells are 

numbered in descending order from n to 1, 
which adds zero to said product, and 
a register for storing the output of said adder. 

* * * * * 
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